Conceptual Grouping in Word Co-Occurrence Networks

نویسندگان

  • Anne Veling
  • Peter van der Weerd
چکیده

Information Retrieval queries often result in a large number of documents found to be relevant. These documents are usually sorted by relevance, not by an analysis of what the user meant. If the document collection contains many documents on one of those meanings, it is hard to find other documents. We present a technique called conceptual grouping that automatically distinguishes between different meanings of a user query, given a document collection. By analysing a word cooccurrence network of a text database, we are able to form groups of words related to the query, grouped by semantic coherence. These groups are used to reorganise the results according to what the user has meant by his query. Testing shows that this automated technique can improve precision, help users find what they need more easily and give them a semantic overview of the document collection. 1 I n t r o d u c t i o n

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The analysis of co-citation and word co-occurrence networks of Iranian articles in the field of dentistry

Background and Aims: Dentistry is an important profession ensuring the health of body and soul, and has a special place in the scientific productions of medical disciplines. The purpose of this study was to analyze the co-citation and word co-occurrence of Iranian research papers in the field of dentistry based on indexed documents in Web of Science from 2014 to 2018. Materials and Methods:...

متن کامل

Survey of Word Co-occurrence Measures for Collocation Detection

This paper presents a detailed survey of word co-occurrence measures used in natural language processing. Word co-occurrence information is vital for accurate computational text treatment, it is important to distinguish words which can combine freely with other words from other words whose preferences to generate phrases are restricted. The latter words together with their typical co-occurring ...

متن کامل

Drawing Word co-occurrence map of Spinal Muscular Atrophy disease

Introduction:  The purpose of this article is to evaluate the status of articles in the field of Spinal Muscular Atrophy According to the Scientometrics indices Word co-occurrence map of this field . Methods: The present study is an applied one with a quantitative approach and a descriptive approach. It has been done using scientometrics and the co-occurrence words analysis technique. Document...

متن کامل

Global topology of word co-occurrence networks: Beyond the two-regime power-law

Word co-occurrence networks are one of the most common linguistic networks studied in the past and they are known to exhibit several interesting topological characteristics. In this article, we investigate the global topological properties of word co-occurrence networks and, in particular, present a detailed study of their spectrum. Our experiments reveal certain universal trends found across t...

متن کامل

Choosing the Word Most Typical in Context Using a Lexical Co-Occurrence Network

This paper presents a partial solution to a component of the problem of lexical choice: choosing the synonym most typical, or expected, in context. We apply a new statistical approach to representing the context of a word through lexical co-occurrence networks. The implementation was trained and evaluated on a large corpus, and results show that the inclusion of second-order co-occurrence relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999